Flexible Analysis of Plant Genomes in a Database Management System
نویسندگان
چکیده
Analysis of genomes has a wide range of applications from disease susceptibility studies to plant breeding research. For example, di↵erent types of barley have di↵ering characteristics regarding draught or salt tolerance. Thus, a typical use case is comparing two plant genomes and try to deduce which genes are responsible for a certain resistance. For this, we need to find di↵erences in large volumes of aligned genome data, which is already available in large genome databases. The challenge is to e ciently retrieve the genotypes of a certain range of the genome, and then, to determine variants and their impact on the plant organism. State-of-the-art tools are fixed pipelines with a fixed parametrization. However, in practice, users want to interactively analyse genome data and need to customize the parametrization. In this demonstration, we show how we can support flexible ad-hoc analyses of arbitrary plant genomes using SQL with a small set of user-defined aggregation functions and dynamic parametrization. Furthermore, we demonstrate how genome analysis workflows for variant calling can be applied to our system and provide insights about the performance of our system.
منابع مشابه
Designing a Bank-Based Flexible Performance Evaluation System (Study: Bank Shahr)
Given the limitations of the existing performance evaluation models for organizations with dynamic internal and external conditions, this study aims to provide a flexible performance evaluation model with adaptability to intra- and extra-organizational changes. The present study first forms a database of criteria related to banking activities. After gathering the experts' opinions, we select 2...
متن کاملPlant genome resources at the national center for biotechnology information.
The National Center for Biotechnology Information (NCBI) integrates data from more than 20 biological databases through a flexible search and retrieval system called Entrez. A core Entrez database, Entrez Nucleotide, includes GenBank and is tightly linked to the NCBI Taxonomy database, the Entrez Protein database, and the scientific literature in PubMed. A suite of more specialized databases fo...
متن کاملMultiobjective Retuning the Power System Stabilizer (PSS) of a Real Power Plant in Iran Grid
The safe operation of power system depends on its stability and security supply in all times. The dynamic instability (small signal instability) is one of phenomena that results in power system instability and has been discussed as a challenge in power system control and operation from a long time ago. Commonly the dynamic instability appears as undamped low frequency electromechanical oscillat...
متن کاملAssessment of genetic diversity in Iranian wheat (Triticum aestivum L.) cultivars and lines using microsatellite markers
In this study, genetic diversity of 20 wheat genotypes was evaluated using 126 simple sequence repeats (SSR) alleles, covering all three wheat genomes. A total of 1557 allelic variants were detected for 126 SSR loci. The number of alleles per locus ranged from 4 to 19 and the allelic polymorphism information content (PIC) varied from 0.66 (Xgwm429) to 0.94 (Xgwm212 and Xgw...
متن کاملAn Optimal Preventive Maintenance Model to Enhance Availability and Reliability of Flexible Manufacturing Systems
General preventive maintenance model for the components of a system, which improves the reliability to ‘as good as new,’ was used to optimize the maintenance cost. The cost function of a maintenance policy was minimized under given availability constraint. On the other hand, in order to ensure appropriate reliability and availability, the development of the optimal maintenanc...
متن کامل